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Abstract 



We construct a Hamiltonian for a single domain protein where the 
contact enthalpy and the chain entropy decrease linearly with the number 
^ . of native contacts. The hydration effect upon protein unfolding is included 

5^ \ by modeling water as ideal dipoles that are ordered around the unfolded 

surfaces, where the influence of these surfaces, covered with an "ice-like" 



CN ■ shell of water, is represented by an effective field that directs the water 

- dipoles. An intermolecular pair interaction between water molecules is 

also introduced. The heat capacity of the model exhibits the common 
feature of small globular proteins, two peaks corresponding to cold and 
warm unfolding, respectively. By introducing vibrational modes, we obtain 
quantitatively good accordance with experiments. 
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X ■ 1 Introduction 

A protein is a large polymer consisting of many thousands of atoms, and may 
therefore from a physical point of view be regarded as a macroscopic system 
Anfinsen proved that the one-dimensional sequence of amino residues deter- 
mines uniquely the three-dimensional conformation of the protein. Further he 
concluded that the native (folded) state is the state of the lowest free energy. 
Consequently, given a polypeptide sequence, a microscopic analysis of its en- 
thalpy, and degrees of freedom, followed by a statistical mechanical evaluation 
should reveal its thermodynamical properties of a given protein. 

Proteins are known to fold on time scales from milliseconds to seconds, in 
great contrast to a statistical analysis based on the vast number of conformations 
in the energy landscape which indicates an astronomical folding time. One way 
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to circumvent this paradox, attributed to Levinthal [Q, is to suppose some kind of 
folding pathway |]^-|10| . The latter means that the folding process, starting from a 
denaturated (unfolded) conformation, follows a specific sequence of folding steps 
in the free energy landscape, until the native state is reached. 

Proteins consist of 20 different amino acids with a great diversity with regard 
to size, polarity and charge. Considering the electrostatics it is possible to argue 
that the small number of charges does not contribute significantly to the stability 
of the native conformation Thus, two kinds of surfaces remain most relevant, 
the apolar (hydrophobic) and polar surfaces 

In this work we model a small single-domain protein consisting of energeti- 
cally equal contacts that lower their energy when they are closed, and it decreases 
linearly with the number of "native like" contacts. In addition, we introduce wa- 
ter molecules, modeled as ideal electrical dipoles in an external effective field 
that represent the influence or ordering effect of the unfolded apolar surfaces on 
the water. Besides, there are pair interactions between water molecules. In a 
self-consistent mean field treatment they add to the effective field |]T2|,|TB[. By 
performing statistical mechanical evaluations we then find thermodynamic func- 
tions like the free energy and the heat capacity of the protein. In the end we 
assign vibrational modes in the IR-region, and compare the heat capacity to 
experimental results on metmyoglobin [|14 



2 The protein model 

In this section we will present the statistical model which is a further development 
of earlier models of Hansen et al. and Bakk et al. [§,0. The chain-chain 
interactions, although reformulated, are used as in these models, but the protein- 
water interactions are new compared to these models. Thus, we will spend the 
greater part of this section discussing protein-water and water-water interactions. 



2.1 Internal forces in the protein 

For simplicity the protein is regarded as consisting of contacts A contact 
is a conformation with a specified free energy. Beyond this specification the model 
does not have a detailed connection to the structure of proteins. However, the 
general character of the model makes it possible to reveal various key features 
about the specific mechanism of protein folding thermodynamics that can lead 
to cold and warm unfolding. Each contact is supposed to be a specific point 
on a folding pathway P-p!0|. The pathway is also in accordance with a "folding 
funnel" |jl6|, where each contact now represents the intersection between a level 
or a "contour line" in the free energy landscape and one of the possible multiple 
pathways ||17|| . 

Upon folding, the protein lowers its enthalpy by an amount ec for each contact 



2 



that "folds" |T8[. Let i G {0, 1, ... , A^} be the number of contacts that are correctly 



folded. Thus the resulting energy for the bare or vacuum chain-chain interactions 
can be written 

Kl = -te,. (1) 

The specific value i = means an unfolded (denaturated) protein, while i = N 
is a complete folded (native) protein. To incorporate the rotational freedom or 
flexibility in the polypeptide backbone, we assign Qc degrees of freedom for each 
unfolded contact, where Qc is interpreted as the relative increase of the degrees 
of freedom for an unfolded contact compared to a folded one. Consequently the 
folding of i contacts corresponds to g^'^ degrees of freedom. It is worth noting 
that this chain-chain interaction model can be viewed as a further simplification 
of the model by Zwanzig |]19|. 



2.2 Hydration effect upon unfolding 

It is known from experiments that the heat capacity change upon aqueous disso- 
lution of apolar molecules from their gaseous state is positive and proportional 
to the solute molecule concentration [^,|2ll], and this change decreases with in- 



creasing temperature [Q. Further a solution of an apolar substance in water is 
associated with a negative entropy change at room temperature, which decreases 
in absolute value with increasing temperature |^. In other words, there seems 
to be an ordering of the water around the apolar surfaces. In sum the hydration 
effect of an apolar molecule can be explained by a gradual melting of an ordered 
"ice-like" shell around these compounds p3[. Melting of ice is a complex process 
whereupon conformational changes implies a change in the enthalpy and the en- 
tropy. In this paper we incorporate the hydration of the apolar surfaces, by an 
extension of a model proposed by Hansen et al. [|,|2^. 

The idea with the water interactions is to cover two basic properties of the 
model. First, there is an ordering of water around unfolded parts of the protein. 
This is accompanied by decreased enthalpy and entropy upon hydration of apolar 
molecules. Secondly, there are interactions between the water molecules. These 
interactions tend to orient the molecules with respect to each other to form an 
"ice-like" structure. The water molecules form hydrogen bonds at tetrahedral an- 
gles with neighboring water molecules. These bonds are associated with location 
of positive and negative charges within the water molecules. This again results in 
large permanent electric dipole moments of these molecules. Thus, we here as a 
simplification, approximate the water molecules by ideal electric dipoles. Apolar 
surfaces, in combination with hydrogen bonds, make it favorable for the water to 
make "ice-like" shell structures around these surfaces. The influence of these ap- 
olar surfaces we thus will model by an electric field E, that also has a structuring 
effect as it rectifies the dipole moments. This field yields an interaction for each 
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dipole 



T^E = -E- s = -Ecost?. (2) 

Here is s the dipole moment of the molecule where we for simplicity put |s| = 1, 
and is the angle between E and s. 

Besides, there will be pair interactions between neighboring molecules with 
the total energy 

Ti-p = — - Jij Si ■ Sj , (3) 

where is the coupling constants between water molecule i and neighboring 
molecules j. The factor 1/2 prevents double counting of interactions. In our 
model these can be regarded as interactions between the water dipole moments. 
We find reasons to take such interactions into account as formation of ice also 
represents a directional ordering of water molecules, and we want to investigate 
their influence. In Appendix we calculate the partition function for one water 
molecule, in Eq. |T3|, by a mean field approximation |T^,|T^ where the field E 
is replaced by an effective field Eg = E + bm. 

The resulting term i in the canonical partition function for the protein has i 
contact energies ec, and Qc degrees of freedom together with M "bound" water 
molecules each contributing a factor Z„ at each of the N — i unfolded contacts, 
thus 

Z, = g^-'e'^'^iA'K)''' {ZX^''-''^ , (4) 

where the factor 47r is the Z„ for E = for the Mi "unbound" bulk water 
molecules. 

Eq. ^ further can be rewritten as 

= e^^^^(47r)^^V-^, (5) 

where the function r, when inserting the Z^ from Appendix (Eq. |T^), is 



sinh pE(. 



(6) 



with a = l/gl^^\ and /i = ec/M. The parameters a and /x will depend upon the 
chemical environments (pH, denaturant concentration, etc.) 

The canonical partition function for the system is now simply the sum over 
Zi for the various contact conformations along the folding pathway 

Z = ^Z, = e^^--(47r)^^ ^ ^ ^ . (7) 

1=0 
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From the definition of tlie internal energy, U = —d{lnZ)/df3, it is clear from 
Eq. ^ tliat tlie exponential contributes with a constant factor to U, thus the heat 
capacity, C = —(3'^dU/d(3, will only depend on the function r in Eq. ^ 

The number of effective parameters is therefore, for a fixed system size (A^ = 
100 in this work), reduced to only four: a, 6, /x, and M, as E in Eq. ^ can be 
included in /3, i.e., one may put = 1, as we have done. 



2.3 Vibrational modes 

As the results will show, the model above yields a heat capacity that lacks certain 
features. Experimentally the heat capacity by hydration increases markedly with 
T, and the "valley" in the folded region is well above zero. Thus to account 
for these latter features we will introduce vibrational modes. These are internal 
modes of the protein due to couplings between neighboring atoms, and they can 
be considered as harmonic oscillators. The quantization of the latter yield the 
energy levels 

HM = {n + ^)hu, (8) 

where h = 6.63 • 10~^^ Js is Planck's constant and u is the frequency. Summing 
over all energy levels gives us the partition function for the vibrational modes for 
A^h independent harmonic oscillators 



-/3HkW \ = (2 sinh (rf/T))-^'^ , (9) 



where d = hv/{2k'B). We suppose, as a very simple assumption, that the vibra- 
tional modes are independent of the degree of folding, thus the partition function 
for the system including these, is 

Z' = Z^Z, (10) 
where Z is the partition function in Eq. |^. 



3 Results and discussion 

Whether the protein is folded or unfolded can now be analyzed in a straight- 
forward way by regarding the ratio r = Zj+i/Zj . The contribution Zi to the full 
partition function Z may be regarded as the partition function for a protein with 
i contacts folded. Thus, a free energy difference A^F between the denaturated 
(d) and the native (n) protein can be expressed as 

A^F = -T(lnZo -InZ^) = TA^lnr , (11) 
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which determines the stable conformation, and gives a direct interpretation of the 
function r. For A^F > or r > 1 the native conformation is thermodynamically 
stable, while for A^F < or r < 1 the denaturated conformation is stable. The 
r = 1 is a critical value, and in a small region around this value the protein 
switches between the two conformations. 

In Fig. we have plotted A^F as function of the temperature for different 
values of the "chemical potential like" parameter fi, while the other parameters 
are fixed. We see that for the three largest values of fi considered there is an 
interval in the middle where the native conformation is stable, while for low 
and high temperatures the denaturated protein is preferred. In other words, one 
has two unfolding transitions, a cold one and a warm one. This cold and warm 



unfolding seems to be a common feature of small globular proteins p5|,p6|. 

The smallest value fi^ = 1.74292 ... in Fig.|l]is a critical one, where the maxi- 
mum of the stability function is at A^F = 0, where the unfolded an folded states 
have equal probabilities. I.e., the lower curve of Fig. ^has a maximum 0.5. Qual- 
itatively the parabolic plots in Fig. |I| corresponds well to experiments of Privalov 
et al. on sperm whale metmyoglobin, where such conformational free energy 
differences were measured. 

The picture of cold and warm unfolding against different values of /i is sub- 
stantiated by a glance at Fig. H, that shows the mean number of folded contacts 
relative to the system size given by 

NYJ^=,Z, N (1 - r^+i) (1 - r) ^ ' 

For a complete denaturated protein n = 0, while for a native one n = 1. 

It is now interesting to study the heat capacity as done in Fig.^. We obtain 
two peaks, which show both cold and warm unfolding. The peaks vanish as 
H decreases towards its critical value. This feature is in accordance with the 



experiments of Privalov et al. |lT^,p5| on small globular proteins. 

By choosing d = 380 K and A'h = 67 as the number of oscillators for Z' (Eqs. 
H and 0), we see in Fig. ^ that the model is qualitatively in good correspondence 
with experimental data. We note the the specific choice d = 380K, corresponds 
to a wavelength ~ 2 ■ 10~^m, which is in the IR region. 

Finally in this section it can be noted that the coupling between the water 
molecules (see Eq. ^ resulting in the parameter b did not have significant influence 
upon the qualitative behavior of our results. We have used a non-zero value of b, 
but replacing it with zero while adjusting other parameters would result in minor 
changes of the results obtained. 
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4 Conclusion 



We have studied a single-domain protein, which is supposed to follow a specific 
folding pathway. The chain-chain contact enthalpy and entropy increase linearly 
with the degree of folding. Each individual water molecule is modeled as a dipole 
in an external electrical field. Between the dipoles there are interactions that are 
incorporated in a mean field approximation. 

We find that the protein in the native form is in an intermediate temperature 
region, while it becomes denaturated for low and high temperatures. This cold 
and warm unfolding behavior is in accordance with experiments on small globular 
proteins [0,|25|,^, and is common to earlier models by Hansen et al. ^,0; and 
Bakk et al. [1,0,13. 

By introducing vibrational modes, we find that the model also yields a more 
accurate quantitative representation of the heat capacity of proteins that undergo 



unfolding transitions at low and high temperatures 14,25 



Appendix 

The assumed pair interaction — Jij Sj • Sj between the dipole moment of water 
molecule i and its neighbors j is in a mean field consideration approximated 
by the term —bm ■ Sj, where Sj is replaced by its average < Sj >= m and 
b = J2j Jij- Such an approximation thus accounts to neglecting correlations 
between neighboring spins. The factor 6m can now be regarded as an added 



electric field by which one obtains an effective (mean) electric field [ll2|,|r 



Ee = E + bm (13) 

that acts upon independent (or free) spins. However, when adding effective fields 
on all spins, interactions are counted twice which is compensated by an energy 
l/2N^biv? for a system counting of A^w water molecules. Thus, in a mean field 
treatment the pair interaction energy in Eq. |^ is approximated by 

Hp -bm • ^ Si + -N^bm'^. (14) 

i 

The partition function for one water molecule becomes 

z„ = 6-1/2/36^2 z;, ^^^^ 

f3 = 1/T, with Boltzmanns constant absorbed in T, and 

Z; = 2n r d^sin^ef"^^'"'' = Att^^^^^. (16) 

Jo P-C'e 
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The polarization (or "magnetization") m is now obtained as |]T3 



m 



coth I3Eq 



d{(3E,) ^ I3E, 

Inserting Eq. in Eq. [T^, one obtain the relation between m and E. 



(17) 
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Figure captions 

Fig. 1. Temperature dependence of the denaturated and native protein free 
energy difference (A^F in Eq. |TT]) for different fi. Other parameters according to 
Eq. I are a = 0.12, 6 = 2, and M = 10. The absolute temperature is rescaled. 



Fig. 2. Temperature dependence of the order parameter n for the corresponding 
parameter set as in Fig. ^ Note that the maximum for /i4 is = 0.5, which 
corresponds to A^F = (see Fig.|l|). 

Fig. 3. Heat capacity for different /i. Other parameters as in Fig.|l|. Note 
the smoothing of the peaks for decreasing fi. The heat capacity for is the 
hydration contribution of the denaturated protein. 



Fig. 4. Heat capacity of metmyoglobin at different n based upon Z' in Eq. 
p!0| . Experimental data from Privalov et al. fl^ on metmyoglobin (Mb), where 
at pH = 3.50 corresponds to a denaturated protein. At pH = 4.10 the protein is 
folded between 20°C and 50°C, and has an unfolding transition at 70°C. One 
is also able to see from the data on pH = 4.10 some destabilizing action at low 
temperatures. 
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Figure 3: 
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Figure 4: 
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